Search CORE

54 research outputs found

Using GeneReg to construct time delay gene regulatory networks

Author: AA Margolin
AJ Butte
DA Orlando
G Karlebach
GAF Seber
J Yu
JM Chambers
K Basso
K Venkatesan
Kang Tu
KC Chen
Lei Liu
Lu Xie
M Bansal
MC Teixeira
S Kim
SS Dwight
T Radivoyevitch
Tao Huang
TJ Hastie
Y Sakamoto
Yixue Li
Ziliang Qian
Publication venue: BioMed Central
Publication date: 01/05/2010
Field of study

Abstract Background Understanding gene expression and regulation is essential for understanding biological mechanisms. Because gene expression profiling has been widely used in basic biological research, especially in transcription regulation studies, we have developed GeneReg, an easy-to-use R package, to construct gene regulatory networks from time course gene expression profiling data; More importantly, this package can provide information about time delays between expression change in a regulator and that of its target genes. Findings The R package GeneReg is based on time delay linear regression, which can generate a model of the expression levels of regulators at a given time point against the expression levels of their target genes at a later time point. There are two parameters in the model, time delay and regulation coefficient. Time delay is the time lag during which expression change of the regulator is transmitted to change in target gene expression. Regulation coefficient expresses the regulation effect: a positive regulation coefficient indicates activation and negative indicates repression. GeneReg was implemented on a real Saccharomyces cerevisiae cell cycle dataset; more than thirty percent of the modeled regulations, based entirely on gene expression files, were found to be consistent with previous discoveries from known databases. Conclusions GeneReg is an easy-to-use, simple, fast R package for gene regulatory network construction from short time course gene expression data. It may be applied to study time-related biological processes such as cell cycle, cell differentiation, or causal inference.</p

Crossref

Directory of Open Access Journals

PubMed Central

Efficient and accurate greedy search methods for mining functional modules in protein interaction networks

Author: A Gavin
B Adamcsek
Baoliu Ye
BS Everitt
C Brun
Chaojun Li
DJ Watts
F Luo
F Radicchi
G Palla
GD Bader
H Jeong
H Leung
HW Mewes
I Xenarios
J Wang
J Wang
J Wang
Jieyue He
L Gao
LF Wu
M Altaf-Ul-Amin
M Girvan
M Li
M Li
M Wu
MEJ Newman
SH Jung
SS Dwight
V Spirin
Wei Zhong
X Li
YR Cho
Z Dezso
Publication venue: BioMed Central
Publication date: 01/06/2012
Field of study

Abstract Background Most computational algorithms mainly focus on detecting highly connected subgraphs in PPI networks as protein complexes but ignore their inherent organization. Furthermore, many of these algorithms are computationally expensive. However, recent analysis indicates that experimentally detected protein complexes generally contain Core/attachment structures. Methods In this paper, a Greedy Search Method based on Core-Attachment structure (GSM-CA) is proposed. The GSM-CA method detects densely connected regions in large protein-protein interaction networks based on the edge weight and two criteria for determining core nodes and attachment nodes. The GSM-CA method improves the prediction accuracy compared to other similar module detection approaches, however it is computationally expensive. Many module detection approaches are based on the traditional hierarchical methods, which is also computationally inefficient because the hierarchical tree structure produced by these approaches cannot provide adequate information to identify whether a network belongs to a module structure or not. In order to speed up the computational process, the Greedy Search Method based on Fast Clustering (GSM-FC) is proposed in this work. The edge weight based GSM-FC method uses a greedy procedure to traverse all edges just once to separate the network into the suitable set of modules. Results The proposed methods are applied to the protein interaction network of S. cerevisiae. Experimental results indicate that many significant functional modules are detected, most of which match the known complexes. Results also demonstrate that the GSM-FC algorithm is faster and more accurate as compared to other competing algorithms. Conclusions Based on the new edge weight definition, the proposed algorithm takes advantages of the greedy search procedure to separate the network into the suitable set of modules. Experimental analysis shows that the identified modules are statistically significant. The algorithm can reduce the computational time significantly while keeping high prediction accuracy.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Using a logical model to predict the growth of yeast

Author: KE Whelan
RD King
RD King
CH Bryant
PGK Reiser
G Giaever
NC Duarte
J Förster
H Kitano
ME Csete
L Chong
EH Davidson
M Kanehisa
M Kanehisa
PD Karp
X Feng
P Mendes
P Mendes
P Mendes
M Tomita
JS Edwards
R Mahadevan
ND Duarte
KJ Kauffman
D Segre
T Shlomi
J Stelling
N Lemke
N Lemke
I Bratko
F Fages
C Gershenson
S Kauffman
B Kuipers
RD King
PA Flach
J Förster
S Muggleton
SS Dwight
E Gasteiger
RG Sargent
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

Abstract Background A logical model of the known metabolic processes in <it>S. cerevisiae </it>was constructed from iFF708, an existing Flux Balance Analysis (FBA) model, and augmented with information from the KEGG online pathway database. The use of predicate logic as the knowledge representation for modelling enables an explicit representation of the structure of the metabolic network, and enables logical inference techniques to be used for model identification/improvement. Results Compared to the FBA model, the logical model has information on an additional 263 putative genes and 247 additional reactions. The correctness of this model was evaluated by comparison with iND750 (an updated FBA model closely related to iFF708) by evaluating the performance of both models on predicting empirical minimal medium growth data/essential gene listings. Conclusion ROC analysis and other statistical studies revealed that use of the simpler logical form and larger coverage results in no significant degradation of performance compared to iND750.</p

Crossref

Aberystwyth Research Portal

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The University of Manchester - Institutional Repository

Enlighten

Explore Bristol Research

pubmed2ensembl: A Resource for Mining the Biological Literature on Genes

Author: A Doms
AA Morgan
AM Jenkinson
B Giardine
BA Eckman
C Plake
Casey M. Bergman
D Hull
D Maglott
D Smedley
E Ryder
EM Zdobnov
G Zhou
Goran Nenadic
H Miller
H Parkinson
J Hakenberg
J Hirschman
J Tamames
JM Fernandez
Joachim Baran
L Chen
L Hirschman
M Ashburner
M Gerner
M Haeussler
M Huang
M Krallinger
M Krallinger
Martin Gerner
Maximilian Haeussler
P Flicek
P Kersey
PA Fujita
R Drysdale
R Hoffmann
R Leinonen
R Lyne
RC Gentleman
S Matos
SM Gallo
SP Shah
SS Dwight
Stein Aerts
T Imanishi
TJ Lee
U Mudunuri
W Xuan
Y Makita
Y Yoshida
Z Lu
Publication venue: Public Library of Science
Publication date: 29/09/2011
Field of study

The last two decades have witnessed a dramatic acceleration in the production of genomic sequence information and publication of biomedical articles. Despite the fact that genome sequence data and publications are two of the most heavily relied-upon sources of information for many biologists, very little effort has been made to systematically integrate data from genomic sequences directly with the biological literature. For a limited number of model organisms dedicated teams manually curate publications about genes; however for species with no such dedicated staff many thousands of articles are never mapped to genes or genomic regions.To overcome the lack of integration between genomic data and biological literature, we have developed pubmed2ensembl (http://www.pubmed2ensembl.org), an extension to the BioMart system that links over 2,000,000 articles in PubMed to nearly 150,000 genes in Ensembl from 50 species. We use several sources of curated (e.g., Entrez Gene) and automatically generated (e.g., gene names extracted through text-mining on MEDLINE records) sources of gene-publication links, allowing users to filter and combine different data sources to suit their individual needs for information extraction and biological discovery. In addition to extending the Ensembl BioMart database to include published information on genes, we also implemented a scripting language for automated BioMart construction and a novel BioMart interface that allows text-based queries to be performed against PubMed and PubMed Central documents in conjunction with constraints on genomic features. Finally, we illustrate the potential of pubmed2ensembl through typical use cases that involve integrated queries across the biomedical literature and genomic data.By allowing biologists to find the relevant literature on specific genomic regions or sets of functionally related genes more easily, pubmed2ensembl offers a much-needed genome informatics inspired solution to accessing the ever-increasing biomedical literature

Crossref

Directory of Open Access Journals

PubMed Central

The University of Manchester - Institutional Repository

Analysis and Prediction of Translation Rate Based on Sequence and Functional Features of the mRNA

Author: AR Gruber
C Chothia
C Ding
D Charif
D Greenbaum
F Gebauer
G Kudla
G Lithwick
G Pollastri
G Pollastri
Grzegorz Kudla
GV Glass
H Liljenstrom
H Peng
Hai-Peng Li
I Dubchak
JE Bergmann
JL Fauchere
JP Le Quesne
Kai-Yan Feng
KC Chou
KC Chou
KC Chou
KC Chou
L Nie
LJ Jensen
M Charton
M Ringner
MA Gilchrist
MP Washburn
NT Ingolia
O Shalem
P Carmona-Saez
P Lu
PM Sharp
Q Tian
R Brockmann
R Grantham
S Galban
S Ghaemmaghami
S Varenne
Sibao Wan
SP Gygi
SS Dwight
T Huang
T Huang
T Huang
T Kawai
T Tuller
T Tuller
Tao Huang
Xiangyin Kong
Y Osada
Yu-Dong Cai
Yufang Zheng
Zhongping Xu
Publication venue: Public Library of Science
Publication date: 06/01/2011
Field of study

Protein concentrations depend not only on the mRNA level, but also on the translation rate and the degradation rate. Prediction of mRNA's translation rate would provide valuable information for in-depth understanding of the translation mechanism and dynamic proteome. In this study, we developed a new computational model to predict the translation rate, featured by (1) integrating various sequence-derived and functional features, (2) applying the maximum relevance & minimum redundancy method and incremental feature selection to select features to optimize the prediction model, and (3) being able to predict the translation rate of RNA into high or low translation rate category. The prediction accuracies under rich and starvation condition were 68.8% and 70.0%, respectively, evaluated by jackknife cross-validation. It was found that the following features were correlated with translation rate: codon usage frequency, some gene ontology enrichment scores, number of RNA binding proteins known to bind its mRNA product, coding sequence length, protein abundance and 5′UTR free energy. These findings might provide useful information for understanding the mechanisms of translation and dynamic proteome. Our translation rate prediction model might become a high throughput tool for annotating the translation rate of mRNAs in large-scale

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

An Improved, Bias-Reduced Probabilistic Functional Gene Network of Baker's Yeast, Saccharomyces cerevisiae

Background: Probabilistic functional gene networks are powerful theoretical frameworks for integrating heterogeneous functional genomics and proteomics data into objective models of cellular systems. Such networks provide syntheses of millions of discrete experimental observations, spanning DNA microarray experiments, physical protein interactions, genetic interactions, and comparative genomics; the resulting networks can then be easily applied to generate testable hypotheses regarding specific gene functions and associations. Methodology/Principal Findings: We report a significantly improved version (v. 2) of a probabilistic functional gene network [1] of the baker's yeast, Saccharomyces cerevisiae. We describe our optimization methods and illustrate their effects in three major areas: the reduction of functional bias in network training reference sets, the application of a probabilistic model for calculating confidences in pair-wise protein physical or genetic interactions, and the introduction of simple thresholds that eliminate many false positive mRNA co-expression relationships. Using the network, we predict and experimentally verify the function of the yeast RNA binding protein Puf6 in 60S ribosomal subunit biogenesis. Conclusions/Significance: YeastNet v. 2, constructed using these optimizations together with additional data, shows significant reduction in bias and improvements in precision and recall, in total covering 102,803 linkages among 5,483 yeast proteins (95% of the validated proteome). YeastNet is available from http://www.yeastnet.org.This work was supported by grants from the N.S.F. (IIS-0325116, EIA-0219061), N.I.H. (GM06779-01,GM076536-01), Welch (F-1515), and a Packard Fellowship (EMM). These agencies were not involved in the design and conduct of the study, in the collection, analysis, and interpretation of the data, or in the preparation, review, or approval of the manuscript.Cellular and Molecular Biolog

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Texas ScholarWorks

Gis1 and Rph1 Regulate Glycerol and Acetate Metabolism in Glucose Depleted Yeast Cells

Author: A Boorsma
A Huber
A Mesquita
A Reinders
AA Falcon
AI Saeed
AP Gasch
AP Schmitt
B Mai
C Allen
C Cheng
C Koch
C Zhu
CR Burtner
D Balciunas
D Balciunas
DE Harrison
DH Nguyen
E Cameroni
E Swinnen
EJ Masoro
F Estruch
G Badis
GK Smyth
H Parkinson
H Takahashi
Hans Ronne
HJ Bussemaker
I Pedruzzi
I Pedruzzi
Ida Olsson
J Fang
J Feser
J Ma
J Norbeck
J Oshiro
J Roosen
J van Helden
J Westholm
Jakub Orzechowski Westholm
Jan Komorowski
JD Hughes
JD Lieb
JL DeRisi
JM Treger
JP Navarro-Aviño
JS Hardwick
Kirsten Nielsen
KK Steffen
L Fontana
L Gautier
M Dequard-Chablat
M Kaeberlein
M Kaeberlein
M Kaeberlein
M Kanehisa
M Lundin
M Wei
M Wei
M Weinberger
MA Beer
MT Martinez-Pastor
N Zhang
N Zhang
Niklas Nordberg
O Medvedik
P Fabrizio
P Fabrizio
P Fabrizio
RC Gentleman
RJ Colman
RJ Klose
RJ Klose
RW Powers 3rd
S Tu
SS Dwight
Susanna Tronnersjö
T Kim
VM Boer
W Dang
W Gorner
WC Burhans
WH White
X Liu
Y Benjamini
Y Chang
Y Tsukada
Y Yu
YK Jang
Z Wu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Aging in organisms as diverse as yeast, nematodes, and mammals is delayed by caloric restriction, an effect mediated by the nutrient sensing TOR, RAS/cAMP, and AKT/Sch9 pathways. The transcription factor Gis1 functions downstream of these pathways in extending the lifespan of nutrient restricted yeast cells, but the mechanisms involved are still poorly understood. We have used gene expression microarrays to study the targets of Gis1 and the related protein Rph1 in different growth phases. Our results show that Gis1 and Rph1 act both as repressors and activators, on overlapping sets of genes as well as on distinct targets. Interestingly, both the activities and the target specificities of Gis1 and Rph1 depend on the growth phase. Thus, both proteins are associated with repression during exponential growth, targeting genes with STRE or PDS motifs in their promoters. After the diauxic shift, both become involved in activation, with Gis1 acting primarily on genes with PDS motifs, and Rph1 on genes with STRE motifs. Significantly, Gis1 and Rph1 control a number of genes involved in acetate and glycerol formation, metabolites that have been implicated in aging. Furthermore, several genes involved in acetyl-CoA metabolism are downregulated by Gis1

Public Library of Science (PLOS)

Crossref

Publikationer från Uppsala Universitet

Directory of Open Access Journals

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

FigShare

The Princeton Protein Orthology Database (P-POD): A Comparative Genomics Analysis Tool for Biologists

Author: A Alexeyenko
A Alexeyenko
A Chatterjee
A Shaag
AB Clark
AJ Herr
AK Agarwal
AS Payne
B Garavaglia
B Samuel Lattimore
Berend Snel
C Garbers
C Srinivasan
CA Hu
CE Grubenmann
CG Frank
CH Kocken
Charles Lu
CJ Loewen
CJ Penkett
DA Benson
DA Pearce
David Botstein
DC Gowda
DJ Kelleher
DL Wheeler
E Catoni
EI Boyle
EJ Vonarx
EV Koonin
F Chen
F Chen
F Liang
Fan Kang
G Hsi
G Schaffar
GF Xu
H Bussey
HS Feiler
I Mayordomo
J Archambault
J Brzeski
J Gecz
J Jantti
J Lenffer
JD Thompson
JF Mercer
JJ Heinisch
K Lai
K Lillard-Wetherell
K Okada
K Yamagata
Kara Dolinski
KP O'Brien
KP O'Brien
L Covic
L Desmyter
L Li
M Forsgren
M Geisler
M Raymond
M Schiott
M Schwarz
M Takeuchi
ME Lucas
MH Kedees
Michael S. Livstone
MM Lanterman
MT Geraghty
N Mamiya
N Raben
N Wagner
NF Neff
O Johnstone
Owen White
P Cavadini
P Poullet
P Sung
PA Colussi
PG Morgante
PJ Keeling
PJ Schmidt
PM Krumpelman
R Ballester
R Boyum
R Jothi
R Kellermayer
R Mancini
R Portmann
R Tommasini
RD Saunders
RK McEwen
RK Raymond
RL Tatusov
Rose Oughtred
S Hofmann
S Nomoto
S Roje
S Tomita
S van Wilpe
S Willingham
Samuel V. Angiuoli
SJ Kron
SK Dutcher
SN Guzder
SS Dwight
Sven Heinicke
T Kataoka
T Kleinow
T Kulikova
T Morita
T Sone
U Rothbauer
V Lumbreras
VK Ton
VK Ton
WK Schmidt
WY Song
XD Gao
Y Chen
Y Kida
Y Lee
Y Onodera
Y Sambongi
Z Peng
Publication venue: Public Library of Science
Publication date: 01/08/2007
Field of study

Many biological databases that provide comparative genomics information and tools are now available on the internet. While certainly quite useful, to our knowledge none of the existing databases combine results from multiple comparative genomics methods with manually curated information from the literature. Here we describe the Princeton Protein Orthology Database (P-POD, http://ortholog.princeton.edu), a user-friendly database system that allows users to find and visualize the phylogenetic relationships among predicted orthologs (based on the OrthoMCL method) to a query gene from any of eight eukaryotic organisms, and to see the orthologs in a wider evolutionary context (based on the Jaccard clustering method). In addition to the phylogenetic information, the database contains experimental results manually collected from the literature that can be compared to the computational analyses, as well as links to relevant human disease and gene information via the OMIM, model organism, and sequence databases. Our aim is for the P-POD resource to be extremely useful to typical experimental biologists wanting to learn more about the evolutionary context of their favorite genes. P-POD is based on the commonly used Generic Model Organism Database (GMOD) schema and can be downloaded in its entirety for installation on one's own system. Thus, bioinformaticians and software developers may also find P-POD useful because they can use the P-POD database infrastructure when developing their own comparative genomics resources and database tools

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Gene Ontology annotations and resources.

Author: Alam-Faruque Y
Apweiler R
Auchinchloss A
Axelsen K
Bahler J
Balakrishnan R
Basu S
Bely B
Berardini TZ
Binkley G
Blake JA
Blatter M-C
Bonilla C
Bouguerleret L
Boutet E
Breuza L
Bridge A
Bridges S
Brown NH
Burgess S
Buza T
Carbon S
Chan J
Chan WM
Chang H-Y
Chavali G
Cherry JM
Chibucos M
Chrisholm RL
Costanzo MC
Coudert E
D'Eustachio P
Dietze H
Dimmer E
Dolan M
Drabkin H
Dwight SS
Dwinell M
Engel SR
Estreicher A
Famiglietti L
Feuermann M
Fey P
Fisk DG
Foulgar R
Gaudet P
Gene Ontology Consortium
Giglio M Gwinn
Gos A
Gruaz-Gumowski N
Harris MA
Hayman T
Hieta R
Hill DP
Hinz C
Hitz BC
Hong EL
Howe D
Hu JC
Huala E
Hulo C
Hunter S
Huntley R
Ireland A
James J
Jungo F
Karra K
Keller G
Kersey PJ
Khodiyar VK
Kibbe WA
Kishore R
Laiho K
Laulederkind S
Legge D
Lemercier P
Lewis SE
Li Ni
Lieberherr D
Lock A
Lomax J
Lovering RC
Lowry T
Magrane M
Martin MJ
Masson P
Matthews L
McAnulla C
McCarthy F
McDowall DM
McIntosh BK
Mi H
Mitchell A
Miyasato SR
Mungall CJ
Mutowo-Muellenet P
Nash RS
O'Donovan C
Oliver SG
Park J
Peddinti D
Pedruzzi I
Petri V
Pichler K
Pillai L
Poggioli D
Porras Millán P
Poux S
Renfro DP
Rivoire C
Roechert B
Roncaglia P
Rutherford K
Sangrador A
Sawford T
Schneider M
Shimoyama M
Siegele DA
Sitnikov D
Skrzypek MS
Staines DM
Stephan R
Sternberg P
Stutz A
Sundaram S
Talmud PJ
Thomas PD
Tognolli M
Tweedie S
Van Auken K
Wang S-J
Weng S
Westerfield M
Wong ED
Wood V
Xenarios I
Zweifel AE
Publication venue: Nucleic Acids Res
Publication date: 01/01/2013
Field of study

The Gene Ontology (GO) Consortium (GOC, http://www.geneontology.org) is a community-based bioinformatics resource that classifies gene product function through the use of structured, controlled vocabularies. Over the past year, the GOC has implemented several processes to increase the quantity, quality and specificity of GO annotations. First, the number of manual, literature-based annotations has grown at an increasing rate. Second, as a result of a new 'phylogenetic annotation' process, manually reviewed, homology-based annotations are becoming available for a broad range of species. Third, the quality of GO annotations has been improved through a streamlined process for, and automated quality checks of, GO annotations deposited by different annotation groups. Fourth, the consistency and correctness of the ontology itself has increased by using automated reasoning tools. Finally, the GO has been expanded not only to cover new areas of biology through focused interaction with experts, but also to capture greater specificity in all areas of the ontology using tools for adding new combinatorial terms. The GOC works closely with other ontology developers to support integrated use of terminologies. The GOC supports its user community through the use of e-mail lists, social media and web-based resources

UCL Discovery

Apollo (Cambridge)

Assessment and refinement of eukaryotic gene structure prediction with gene-structure-aware multiple protein sequence alignment

Author: A Nagy
A Nagy
A Sali
BB Wang
BE Bernstein
BJ Haas
C Liang
D Haussler
David R Nelson
DJ Russell
DM Goodstein
DR Nelson
DR Nelson
DT Jones
DT Jones
DT Jones
E Birney
F Morcos
H Iwata
H Iwata
H Kamisetty
H Nagasaki
I Verde
IB Rogozin
IM Meyer
IV Grigoriev
J Harrow
J Soding
JC Estill
JE Allen
JL Ashurst
K Katoh
K Yook
M Gribskov
M Stanke
M Yandell
Mariko Morita
MO Dayhoff
O Gotoh
O Gotoh
O Gotoh
O Gotoh
O Gotoh
O Gotoh
O Gotoh
O Gotoh
Osamu Gotoh
PS Schnable
Q Dong
R Madupu
S Yamada
S Yamada
SF Altschul
SP Verma
SS Dwight
V Curwen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref